AN EXPLICIT COUNTEREXAMPLE TO THE LAGARIAS-WANG 
FINITENESS CONJECTURE 



KEVIN G. HARE, IAN D. MORRIS, NIKITA SIDOROV, AND JACQUES THEYS 
To the memory of I. M. Gelfand 

Abstract. The joint spectral radius of a finite set of real d X d matrices is defined to be the 
maximum possible exponential rate of growth of long products of matrices drawn from that set. 
A set of matrices is said to have the finiteness property if there exists a periodic product which 
achieves this maximal rate of growth. J. C. Lagarias and Y. Wang conjectured in 1995 that 
every finite set of real d X d matrices satisfies the finiteness property. However, T. Bousch and 
J. Mairesse proved in 2002 that counterexamples to the finiteness conjecture exist, showing in 
particular that there exists a family of pairs of 2 X 2 matrices which contains a counterexample. 
Similar results were subsequently given by V. D. Blondel, J. Theys and A. A. Vladimirov and 
by V. S. Kozyakin, but no explicit counterexample to the finiteness conjecture has so far been 
given. The purpose of this paper is to resolve this issue by giving the first completely explicit 
description of a counterexample to the Lagarias- Wang finiteness conjecture. Namely, for the set 

1 1 A / 1 



1 J \ 1 1 
we give an explicit value of 

a, ~ 0.749326546330367557943961948091344672091327370236064317358024 . 
such that Aoi, does not satisfy the finiteness property. 



1. Introduction 

If A is a d X d real or complex matrix and || • || is a matrix norm, the spectral radius p{A) of 
the matrix A admits the well-known characterisation 

p{A) = lim 1 1 A" 11^/", 

?i— >-oo 

a result known as Gelfand's formula. The joint spectral radius generalises this concept to sets of 
matrices. Given a finite set oi d x d real matrices A = {^i, • • ■ ,Ar}, we by analogy define the 
joint spectral radius g{A.) to be the quantity 

g{A) := limsupmaxj IjAij • • • AiJ\^^" : ij e {1, . . . , r} i , 

a definition introduced by G.-C. Rota and G. Strang in 1960 [45] (reprinted in [44J. Note that 
the pairwise equivalence of norms on finite dimensional spaces implies that the quantity g{A) is 
independent of the choice of norm used in the definition. 

The joint spectral radius has been found to arise naturally in a range of mathematical contexts 
including control and stability [U [TTJ [20l , coding theory [37] , the regularity of wavelets and 
other fractal structures [12[ [T3| [36| [42] , numerical solutions to ordinary differential equations [19] , 
and combinatorics [U [Tl] . As such the problem of accurately estimating the joint spectral radius 
of a given finite set of matrices is a topic of ongoing research interest [5] [THJ [SU [SIl IMl HO! UHl US] ■ 
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In this paper we study a property related to the computation of the joint spectral radius of a set 
of maXrices, termed the finiteness property. A set of c? x d real matrices A := {Ai, . . . , A^} is said to 
satisfy the finiteness property if there exist integers ii, . . . , i„ such that g{A) = p{Ai-^ ■ ■ ■ Ai^Y^^. 
The finiteness conjecture of J. Lagarias and Y. Wang [34] asserted that every finite set oi dx d real 
matrices has the finiteness property; a conjecture equivalent to this statement was independently 
posed by L. Gurvits in [20J, where it was attributed to E. S. Pyatnitskii. In special cases, this 
finiteness property is known to be true, see for example [lOl [25] . The existence of counterexamples 
to the finiteness conjecture was established in 2002 by T. Bousch and J. Mairesse |8], with alter- 
native constructions subsequently being given by V. Blondel, J. Theys and A. Vladimirov |6| and 
V. S. Kozyakin However, in all three of these proofs it is shown only that a certain family of 
pairs of 2 X 2 matrices must contain a counterexample, and no explicit counterexample has yet been 
constructed. The problem of constructing an explicit counterexample has been remarked upon as 
difficult, with G. Strang commenting that an explicit counterexample may never be established 
|44j . In this paper, we resolve this issue by giving the first completely explicit construction of a 
counterexample to the Lagarias- Wang finiteness conjecture. 

Let us define a pair of 2 x 2 real matrices by 

and for each a € [0,1] let us define Aq, := {Ao,aAi}. The construction of Blondel- Theys- 
Vladimirov [5] shows that there exists a € [0, 1] for which A^ does not satisfy the finiteness 
property. The proof operates indirectly by demonstrating that the set of all parameter values a 
for which the finiteness property does hold is insufficient to cover the interval [0, 1]. In this paper 
we extend substantially by describing the behaviour of Q{l^a) as the parameter a is varied in 
a rather deep manner. This allows us to prove the following theorem: 

Theorem 1.1. Let (t„)^q denote the sequence of integers defined by tq :— 1, ti,T2 :— 2, and 
Tn+i '■= TnTn-i — T„_2 fof 0,^1 n > 20 and let (-F'n)^o denote the sequence of Fibonacci numbers, 
defined by Fq := 0, Fi :— 1 and :— F„ + Fn-i for all n > 1. Define a real number a* G (0, 1] 
by 

(1.1) a. hm i^) = TT 1 - 

Then this infinite product converges unconditionally, and A^^ does not have the finiteness property. 

The convergence in both of the limits given in Theorem 11.11 is extremely rapid, being of or- 
der O (exp(— J^")) where (5 > is some constant and </> is the golden ratio. An explicit error 
bound is given subsequently to the proof of Theorem 11.11 Using this bound we may compute the 
approximation 

a* ~ 0.749326546330367557943961948091344672091327370236064317358024 . . . 

which is rigorously accurate to all decimal places shown. 

We shall now briefly describe the technical results which underlie the proof of Theorem ll.il For 

each a £ [0, 1] let us write Ag"'' :— Aq and A^-^^ :— aA\ so that A^ = |ylg"'',yl^"''|. The principal 
technical question which is addressed in this paper is the following: if we are given that for some 
finite sequence of values u\, . . . , u„ e {0, 1}, the matrix 

(1-2) ^^jAi-j^^-.-AWAif 

is "large" in some suitable sense - for example, if its spectral radius is close to the value Q^^aT" 
- then what may we deduce about the combinatorial structure of the sequence of values Ui , and 
how does this answer change as the parameter a. is varied? A key technical step in the proof of 
Theorem ll.il therefore, is to show that the magnitude of the product (|1.2p is maximised when the 
sequence ui,U2, . . . ,Un is a balanced word. This result depends in a rather essential manner on 



^This is the sequence A022405 from Sloane's On-Line Encyclopedia of Integer Sequences. 
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several otherwise unpublished results from the fourth named author's 2005 PhD thesis [17], which 
are substantially strengthened in the present paper. 

In the following section we shall introduce the combinatorial ideas needed to describe balanced 
words. We are then able to state our main technical theorem, describe its relationship to previous 
research in ergodic theory and the theory of the joint spectral radius, and give a brief overview 
of how Theorem 1 1.1 1 is subsequently deduced. The detailed structure of this paper is described at 
the end of the following section. 



2. Notation and statement of technical results 

Throughout this paper we denote the set of all d x d real matrices by M(i(K). The symbol ||| • ||| 
will be used to denote the norm on Mrf(M) which is induced by the Euclidean norm on M'', which 
satisfies |||S||| = p{B*Bf/'^ for every B € M(j(M). Other norms shall be denoted using the symbol 
II • 1]. We shall say that a norm || • || on M2(M) is submultiplicative if ||^-B|| < \\A\\ ■ \\B\\ for all 
A,Bg M2(K). For the remainder of this paper we shall also denote g{Aa) simply by g{a). 

For the purposes of this paper we define a finite word, or simply word to be sequence u = (ui) 
belonging to {0, 1}" for some integer n > 0. We will typically use u, v or w to represent finite 
words. If u e {0, 1}" then we say that u is length n, which we denote by |u| = n. If |m| is zero 
then the word u is called empty. The number of terms of u which are equal to 1 is denoted by 
1^1 1. If M is nonempty, the quantity |m|i/|u| is called the 1-ratio of u and is written <;{u). The two 
possible words of length one shall often be denoted simply by and 1. We denote the set of all 
finite words by Vl. 

We will define an infinite word to be a sequence x = (xi) belonging to {0, l}^. We win typically 
use X, y or z to represent infinite words. If the word can be either finite or infinite, we will typically 
use w. We denote the set of all infinite words by E, and define a metric d on S as follows. Given 
x,y GT, with X = {xi)'^i and y — define n{x,y) :— inf{z > I: Xi ^ yt}. We now define 

d{x,y) :~ l/2"(^'^) for all x,y € E, where we interpret the symbol 1/2°° as being equal to zero. 
The topology on E which is generated by the metric d coincides with the infinite product topology 
on E = {0, 1}^. In particular E is compact and totally disconnected. For any nonempty finite 
word u — {ui)f^i the set {x Xi ~ Ui for all 1 < z < is both closed and open. Since every 
open ball in E has this form for some u, the collection of all such sets generates the topology of E. 

We define the shift transformation T : E — )■ E by T[{xi)°^i] ■— {xi+i)iZi- The shift transforma- 
tion is continuous and surjective. We define the projection 7r„ : E — > by TTn[{xi)°Zi] — {xi)'^^^. 

If M = U1U2 ■ ■ - Un and V = V1V2 ■ ■ ■ Vm are finite words, then we define the concatenation of u 
with V as uv = U1U2 ■ ■ ■ UnViV2 ■ ■ ■ Vm, the finite word of length n + m. Note that if u is the empty 
word then uv = vu = v for every word v. The set endowed with the operation of concatenation 
is a semigroup. 

Given a word u and positive integer n we let u" denote the linear concatenation of n copies of 
u, so that for example := uuuu. If u is a nonempty word of length n, we let u°° denote the 
unique infinite word a; G E such that Xkn+i — Ui for all integers i, k with fc > 1 and 1 < i < n. 
Glearly any infinite word a; S E satisfies T"a; — x for an integer rt > 1 if and only if there exists a 
word u such that x — u°° and \u\ divides n. 

If M is a nonempty word, and uj is either a finite or infinite word, we say that u is a suhword of 
Lo if there exists an integer fc > such that Ui = cjk+i for all integers i in the range 1 < i < |w|. We 
denote this relationship by it ^ w. Clearly u ^ a; if and only if there exist a possibly empty word 
D e O and a finite or infinite word w' such that a; = auuj' . An infinite word x is said to be recurrent 
if every finite subword u ^ x occurs as a subword of x an infinite number of times. A finite or 
infinite word w is called balanced if for every pair of finite subwords u,v such that u,f -< a; and 
|u| = |f I, we necessarily have ||u|i — |ti|i| < 1. Glearly uj is balanced if and only if every subword 
of UJ is balanced. An infinite balanced word which is not eventually periodic is called Sturmian. 

The following standard result describes the principal properties of balanced infinite words which 
will be applied in this paper: 
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Theorem 2.1. If x £ Y, is balanced then the limit (;{x) :~ lim„_>.oo ^^(7r„(a;)) exists. For each 
7 € [0, 1], let denote the set of all recurrent balanced infinite words a; G S for which (,(x) = 7. 
These sets have the following properties: 
(i). Each is compact and nonempty. 

(ii). For each 7 e [0,1] , the restriction of T to X^ is a continuous, minimal, uniquely ergodic 
transformation of Xj. If fi is the unique ergodic probability measure supported in X^, then 
^i{{x: xi = 1}) = 7. 

(Hi). If "f = p/q G [0, 1] n Q in lowest terms then the cardinality of Xj is q, and for each x G X-^ 
we have X~^ = {x,Tx, . . . ,T''~'^x}. If J £ [0, 1] \ Q then X^ is uncountably infinite. 

Example 2.2. We have X2/5 = {(00101)°°, (01010)°°, (10100)°°, (01001)°°, (10010)°°}. 

Theorem 12.11 does not appear to exist in the Hterature in the precise form given above, but it 
may be estabhshed without difficuhy by combining various results from the second chapter of |35j . 
The key step in obtaining Theorem 12. II is to show that x G X.y if and only if there exists S G [0, 1) 
such that either Xn = [{n + l)j + 5\ — [n7 + (5J , or a;„ = [(n + l)7 + (5] — [717 + ^] , see Lemmas 2.1.14 
and 2.1.15 of ^35) . Once this identification has been made, the dynamical properties of X^ under 
the shift transformation largely follow from the properties of the rotation map z 1— > z + 7 defined 
on R/Z. 

Given a nonempty finite word u — (w^)"^]^ and real number a G [0, 1], we put 

A^-Hu) ■.^Ail^Ai»l^---Ai^Mii' 

and 

A{u) Au„Au„_, ■ ■ ■ Au,Au, = A^^^u). 
For every x G S, Q G [0, 1] and n > 1 we also define 

^(")(a;,n) := yi(")(7r„(x)), A{x,n) yl(^„(a;)) = yt^^) (x, n). 
Note that the function A{x, n) satisfies the cocycle relationship 

A{x, n + m) = A{T"x, m)A{x, n) 

for every x G S, n, ?Ti > 1. 

Our main task in proving Theorem 11.11 is to characterise those infinite words a; G E for which 
A{x^n) grows rapidly in terms of the sets X^. To do this we must be able to specify what is 
meant by rapid growth. Let us therefore say that an infinite word x G S is a strongly extremal 
word for Aq, if there is a constant (5 > such that |||^'-"-'(x, n)||| > (5^?(a)" for all n>l, and weakly 
extremal for Aq, if lim„_j.oo |||yl*^")(x, n)!^^" = q{q.). It is obvious that every strongly extremal 
word is also weakly extremal. Note also that since all norms on M2(R) are equivalent, these 
definitions are unaffected if another norm || • || is substituted for ||| • |||. We shall say that r G [0, 1] 
is the unique optimal 1-ratio of Aq, if for every x G S which is weakly extremal for Aq we have 
<;(7r„(x)) — > r. Note that the existence of a unique optimal 1-ratio is a nontrivial property, and is 
shown in Theorem 12.31 For example, if A C M2(K) is a pair of isometrics then no unique optimal 
1-ratio for A exists. It is not difficult to see that if Aq has a unique optimal 1-ratio which is 
irrational, then Aq cannot satisfy the finiteness property, and it is this principle which underlies 
the present work as well as the work of Bousch-Mairesse [8j and Kozyakin f29| . 

The principal technical result of this paper is the following theorem which allows us to relate 
all of the concepts defined so far in this section: 

Theorem 2.3. There exists a continuous, non- decreasing surjection t: [0,1] -> [0, i] such that 
for each a, v{a) is the unique optimal 1-ratio of Aq,. For each a G [0, 1], every element of X^(^a) 
is strongly extremal for Aq. Moreover, for every compact set K C (0,1] there exists a constant 
Ck > 1 such that 



(2.1) 



_i p(yl(")(x,n)) |||yl(")(x,n)| 
Cj^ < — < 



£)(a)" 



< C, 



K 
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whenever a G K, x £ ^r(Q) ^.^d n > 1. Conversely, if x ^Y, is a recurrent infinite word which is 
strongly extremal for A^, then x € ^r(a) ; if x (z T, is any infinite word which is weakly extremal 
for A„ then (l/n) Y^Hl dist(T'=x, 0. 

Remark 2.4. The definition of a strongly extremal infinite word is similar to the one previously 
proposed by V. S. Kozyakin [30| . whereas the definition of a weakly extremal infinite word is 
similar to a definition used previously by the fourth named author [4 7) . In both instances the 
infinite word is simply referred to as 'extremal'. 

Remark 2.5. Note that balanced/ Sturmian words (and measures) arise as optimal trajectories in 
various optimisation problems - see, e.g., [71 [51 . 

Remark 2.6. A less general version of parts of Theorem 12.31 was proved in |47| . 

The structure of the paper is as follows: Sections [3] and |4] deal with important preliminaries, 
such as general properties of joint spectral radius and of balanced words. In Section [5] we show 
that every strongly extremal infinite word is balanced. In Section [B] we introduce an important 
auxiliary function S defined as the logarithm of the exponent growth of the norm of an arbitrary 
matrix product taken along balanced words with a fixed 1-ratio. In Section [7] we apply results 
from preceding sections to prove Theorem 12.31 Finally, in Section [8] we deduce Theorem 1 1 . 1 1 from 
Theorem 12.31 Section [9] contains some open questions and conjectures. 

We believe it is worth describing here briefiy how Theorem 12.31 leads to Theorem 11.11 Once we 
have established the existence of such a function r, we may take any irrational 7 and conclude 
that any element a of the preimage t~^(7) is a counterexample to the finiteness conjecture (since 
any weakly extremal word must be aperiodic). 

To construct a specific counterexample, we take 7 = ^^-5^ and choose the Fibonacci word Uoo 
as a strongly extremal word for this 1-ratio. Recall that Wqo = lim„ U(„), where u^i^^ = 1, U(^2) = 
and U(n+i) = ''^(n)^^(n-i) for n > 2. Now consider the morphism ft, : f2 — )■ ]V[2(K) such that 
h{0) — Aq, h{l) = Ai. Denote B„ := h{u^n))\ we thus have Bn+i = B„B„-i. One can easily show 
that tr (Bn) = Tn, the sequence described in Theorem ll.il To obtain explicit formulae for a*, we 
show that the auxiliary function S introducted in Section [HI is differentiable at 7 = '^^^ and that 
— log a* = S' \ ■ We then compute this derivative, which will yield (|l.ip . 



3. General properties of the joint spectral radius and extremal infinite words 

We shall begin with some general results concerning the joint spectral radius. The following 
characterisation of the joint spectral radius will prove useful on a number of occasions: 

Lemma 3.1. Let a G [0, 1] and let \\ ■ \\ he any suhmultiplicative matrix norm. Then: 

g{a) — inf max | ||yl(a;, n)|| : a; G S > = sup max | p {A{x, n))^^^ : x G S I . 
n>l L J „>i L J 

Proof. We review some arguments from |51[T^. Fix a G [0, 1] and a matrix norm || • ||, and define 

gt{a,\\ ■ II) :=max{||A|<..A|f II : (ii,...,i„) G {0, 1}" } = max { || A^"' (^, ")|| ■ x e J:} 
and 

Q-{a) :=max{p(A|f'...A["^) : Zi,...,i„ G {0, 1}} - max {p (a^") (x, n)) : x G e} . 

Clearly each Q+{a, || • ||) is nonzero, and £>++„(«, || • ||) < £»+(«, || • ||)£'+ (a, || • ||) for every n, m > 1. 
Applying Fekete's subadditivity lemma [TSj to the sequence \ogg^{a, || • ||) we obtain 

hm ^,+ (a, ||. 11)1/"= inf o+(a,||.||)V". 

n— ^00 n>l 

In particular the limit superior in the definition of g{a) is in fact a limit. A well-known result of 
Berger and Wang [5] implies that 

hm ^.+ (a,||-||)i/"=limsupf.-(a)i/". 
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which in particular implies that the value g{a) is independent of the choice of norm || • j|. Finally, 
note that if p (^A^'' ■ ■ ■ A\"^^ = ^?n (<^) for some n, then gi^„j(a) > p{{Ai^ ■ ■ ■ Ai^)"^) ~ g:^{a)"^ for 
each m > 1, and hence the limit superior above is also a supremum. □ 

We may immediately deduce the following corollary, which was originally noted by C. Heil and 
G. Strang (22! : 

Lemma 3.2. The function g: [0, 1] — > M is continuous. 

Proof. The first of the two identities given in Lemma 13.11 shows that g is equal to the pointwise 
infimum of a family of continuous functions, and hence is upper semi-continuous. The second 
identity shows that g also equals the pointwise supremum of a family of continuous functions, and 
hence is lower semi-continuous. □ 

Lemma 3.3. For each a G (0,1] there exists a matrix norm |1 • |1q such that A["^ < g{a) 

a 

for j 0, 1. The matrix norms \\ ■ \\a may be chosen so that the following additional property is 
satisfied: for every compact set K C (0, 1] there exists a constant Mk > 1 such that Mf^ II^IU — 
lll-BII < AfA'||B||„ for all B e M2(M) and all ae K. 

Proof. Let B {Bi, . . . , Br} he any finite set oidxd real matrices and let g{B) be its joint spectral 
radius. We say that B is irreducible if the only linear subspaces V C M.'^ such that BiV C V for 
every i are {0} and R''. A classic theorem of N. E. Barabanov |lj shows that if B is irreducible 
then there exists a constant Mb > 1 such that for each n > 1, 

miix{\\B,,...B,J: ij G {l,...,r}} < MegiB)". 

Note in particular that necessarily g{B) > 0. It is then straightforward to see that if we define for 
each V e 

\\v\\b := sup {e(B)-"max|||S,, will : e {!,..., r}}, 

n>0 

where ||| • ||| denotes the Euclidean norm, then || • ||b is a norm on R'* which satisfies ||i?iti||B < 
e(B)||f||B for every i € {!,..., r} and v e R''. It follows that the operator norm on M2(R) 
induced by || • ||b has the property ||i?i||B < for each Bi. More recent results due to F. Wirth 
[JS, Thm. 4.1] and V. S. Kozyakin [37 show that the constants Mb may be chosen so as to depend 
continuously on the set of matrices B, subject to the condition that the perturbed matrix families 
also do not have invariant subspaces. It is easily shown that is irreducible for every a S (0, 1] 
and so the lemma follows from these general results. □ 

We immediately obtain the following: 

Lemma 3.4. For each a E (0, 1] we have g{a) > 1. 

Proof. Assume ^(q;) < 1 for some a G (0,1]. Then we have sup{m(") (0") |U: ?^ > 1} < 1 by 
Lemma [331 and consequently sup{|||Ao||| : ti > 1} < oo. Since Aq = (qi), w2 have lim„^oo ll^o III — 
+00 and therefore we must have g{a) > 1. □ 



Fix some norm || • \\a which satisfies the conditions of Lemma 13.31 The following key result is 
a variation on part of [351 Thm 2.2]. We include a proof here for the sake of completeness. 

Lemma 3.5. For each a E (0, 1] define 

oo 

Za -.^ f] {x E^: A^'''>{x,n) =e(a)"}. 

n=l " 

Then each is compact and nonempty, and satisfies TZ^ Q Z^- 
Proof. Fix a E (0,1] and define for each n > 1 

Z„,„ |x e E : W^"'^ (x, n) = e(a)" | . 
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Clearly each Z^ ^ is closed. If some Z^^n were to be empty, then by Lemma 13.31 we would 
have sup { n) : a: G S} < p(a)", contradicting Lemma [3.11 For each n > 1 we have 

Za.n+l Q Za^n, SiuCC if X G Za,n+1 then 



g{a) 



n+l _ 



{x, n)\\ < g{a) 



n+l 



using Lemma 13.31 and it follows that x £ Z^.n also. We deduce that the set ~ nn=i ^q,™ is 
nonempty. Since each Za^n is closed, Za is closed and hence is compact. Finally, if x G Z^.n+i 
then we also have 



g{a) 



n+l 



< 



A'^°'HTx,n) A''°'\x,l) 



a 

SO that Tx G „, and we deduce from this that TZ^ C Za. 



□ 



The remaining lemmas in this section will be applied in the proof of Theorem l2.3l to characterise 
the extremal orbits of . 

Lemma 3.6. Let a G (0, 1] and x G S. If x is recurrent and strongly extremal for Aq,, then 
X G Za . 



Proof. Let a G (0,1] and x G S \ Zq, and suppose that x is recurrent. We shall show that 
liminf„_>.oo ^?(c^) " = and therefore x is not strongly extremal, which proves the 

lemma. Since x ^ Za, there exist e > and uq > 1 such that ||.4^"-'(x,no)||^ < (1 — e)g{a)^". 
Since x is recurrent, it follows that for each fc > 1 we may find integers > rk-i > . . . > r2 > 
ri = such that ||yl*^"''(T'''a;, no)||^ < (1 — e)g{ay^° for each i. By increasing k and passing to 
a subsequence if necessary, it is clear that we may assume additionally that r^+i > r.; + rig for 
1 < i < k. Define also rk+i :— rk + uq + 1. We have 



a ll a 

< (1 - e)^-f?(ar'=+S 
and since k may be taken arbitrarily large we conclude that 



Ti+no 



X, Ti+i - ri 



no) 



liminf A'^°'\x,n) 



-0, 



as desired. 



□ 



The following lemma is a straightforward corollary of a more general result due to S. J. Schreiber 
[46l Lemma 1]: 

Lemma 3.7. Let (/„) be a sequence of continuous functions from S to M such that fn+m{x) < 
fn{T"^x) + fm{x) for all X £ Y] and n,m > 1. Then for each x G S and m > 1, 

1 ""^ 1 
liminf fyn{T^x) > liminf -/„(x). 

n-i-oo nm ^ — ' n->oo n 

Lemma 3.8. Let a G (0,1] and suppose that the restriction of T to Za is uniquely ergodic, with 
fi being its unique T-invariant Borel probability measure. Then r :— /i ({a; G S: xi = 1}) is the 
unique optimal 1-ratio o/Aq,, and if x G T, is weakly extremal, then 

n— 1 

lim — \^ dist {T^x, supp /i) = 0. 

k=0 
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Proof. Let Ai denote the set of all Borel probability measures on E equipped with the weak-* 
topology, which is defined to be the smallest topology such that /i i— > J f dfi is continuous for every 
continuous function / : E — M. This topology makes A4 a compact metrisable space [41] Thm. 
II. 6. 4]. Let us fix a e (0, 1] and suppose that a; e E is weakly extremal. For each n > 1 define 
/i„ i^/n) X]fc=o ^T^'a; £ whcrc & A4 denotes the Dirac probability measure concentrated 



at z e E. We claim that lim„ 



— ^ in the weak-* topology. 



Applying Lemma [5771 with /„(a;) :— log ||^^") 
all X and n, we obtain 



and noting that fn{x) < nlogg{a) for 



(3.1) 



lim 



/ - log ^(") (z, N) dfi„{z) = lim — V log ^(") {T'x, N) = log g{a) 

J N a n-J-oo nJy ^ ^ ' a 



forevery iV > 1. As in the proof of Lemma[33]we let Z^m {z G E: (z, iV)||a = for 

each iV > 1, and we recall that Za^N+\ Q Za,N for every N . Let v € M.he any limit point of the 
sequence (/x„). If / : E — >■ R is any continuous function then it follows easily from the definition 
of that 1/ f dv — J f oT dv\ < limsup„^g^ I / / ° Td^n — J f d^n\ — and it follows that v 
is T- invariant. For each iV > 1 we have 



log 



^(")(z,Ar) diy{z) = log g{a), 



and since ||.4'^"^(z, A^)||^ < g{a)^ for all z e E it follows from this that i'{Za,N) = 1- Since this 
applies for every N, and Za,N+i Q Za,N for every N, we deduce that i^iZa) = 1. By hypothesis /x 
is the unique T-invariant element of A4 giving full measure to Zq,, and it follows that v = fi. We 
have shown that fi is the only weak-* accumulation point of the sequence and since A4 is 

compact and metrisable we deduce that lim„_j.oo fJ-n — fJ-, which completes the proof of the claim. 

The proof of the lemma now follows easily. Let / : E R be the characteristic function of the 
set {x e E: xi = 1}, and note that / is continuous since this set is both open and closed. Define 
a further continuous function by g{x) := dist(a;, supp /i). Since /i„ — > /i we may easily derive 



lim <;(7r„(x)) 



lim — 



n-1 

E 

i=0 



lim / /(i/Li„ 



and 



lim — 

n— fC30 fi 



n-1 
1=0 



dist(T'a;, supp /i) = lim 

' n— >-oc 
I 

as required. The proof is complete. 



f djj, — ^jl{{x e E : xi = 1}) = r 



d/iji = / g dfi = 



□ 



4. General properties of balanced words 

In this short and mostly expository section we present some combinatorial properties of balanced 
words which will be applied in subsequent sections. We first require some additional definitions. 

Given two nonempty finite words u, v of equal length, we write u < v ii u strictly precedes v 
in the lexicographical order: that is, m < w if and only if there is fc > 1 such that Uk — 0, Vk — 1, 
and Ui = Vi when 1 < i < k. We define the reverse of a finite word u, which we denote by u, to 
be the word obtained by listing the terms of u in reverse order. That is, if it = uiU2 ■ ■ ■ Un then 
u = UnUn-i ■ ■ ■ ui. Wc Say that a finite word p is a palindrome ii p — p. Since the reverse of the 
empty word is also the empty word, the empty word is a palindrome. We say that two finite words 
u and V of equal length are cyclic permutations of each other, and write m ~ if there exist finite 
words a and b such that u = ah and v = ha. For each n > this defines an equivalence relation 
on the set of words of length n. 

We begin by collecting together some standard results from [35] : 

Lemma 4.1. Let 7 e (0,1) and x G X^, and choose any N > max{[7~^], [(1 — 7)^"'^]}. Then 
neither 0^ nor 1^ is a subword of x. 



COUNTEREXAMPLE TO THE FINITENESS CONJECTURE 



9 



Proof. Let u ^ X with |u| = iV. By [351 Prop. 2.1.10] we have 7|u| + 1 > |u|i > 7|u| - 1. 
In particular we have > 7[7^^] — 1 > and \u\ — \u\i > (1 — 7)[(1 — 7)~^1 — 1 > 0, so 
< < |u| and u cannot be equal to 0^ or 1^. □ 

Definition 4.2. Let W C ft x il he the smallest set with the following two properties: (0, 1) S W; 
if e W, then {uv,v) £ W and {u,vu) G W. We say that u G is a standard word if either 

{v, w) G W or {u, v) E W for some v E il. 

Lemma 4.3. The set of standard words has the following properties: 
(i) If u is standard, with \u\ = q and \u\i = p, then u°° G Xp/g. 

(ii) For every 7 G [0, 1] there exists x G X-^ such that for infinitely many q G N the word 7rq{x) 
is standard. 

Proof, (i). li q = 1 then the result is trivial. For g > 1, [351 Prop. 2.2.15] shows that every 
standard word is balanced. If u is standard, then it is clear from the definition that u" is a 
subword of a standard word for every n > 1. In particular every u" is balanced and therefore u°° 
is balanced. 

(ii). Let X be the infinite word defined by Xn ■= [lin + 2)J — [7(71 + 1)J G {0, 1} for all n> 1. 
This word is called the characteristic word for 7. It is shown in |35[ Prop 2.2.15] that x has the 
required properties. □ 

The following result is given in the proof of [351 Prop. 2.1.3]. Note that p may be the empty 
word; for example, this is true in the case w — 0011. 

Lemma 4.4. Let w be a finite word which is not balanced, let u and v be subwords of w of equal 
length such that |it|i > 2 + and suppose that u, v have the minimum possible length for which 
this property may be satisfied. Then there is a palindrome p such that u — Ipl and v = OpO. 

The following two results arise in the fourth named author's PhD thesis 14 7 j : 

Lemma 4.5. Let w be a finite word and p a palindrome, and suppose that OpO and Ipl are 
subwords of w. Then there is a finite word b, which may be empty, such that either OpOblpl or 
IplbOpO is a subword of w. 

Proof. Recall that u ^ v means that m is a subword of v. Since OpO and Ipl are both subwords 
of w, the only alternative is that they occur in an overlapping manner: that is, there are finite 
words d, e, / such that OdleO/l -< w, where die — eOf — p, or similarly with and 1 interchanged. 
Since p = p, the relation die = eOf = p implies eld = eO/, and since |e| = |e| we obtain 1 = 0, a 
contradiction. We conclude that the words OpO and Ipl cannot overlap, and the result follows. □ 

Lemma 4.6. Let u be a finite word which is not balanced. Then there exist words a, w, b such 
that awb ~< u and one of the following two possibilities holds: either b > a and w > w, or a > b 
and w > w. 

Proof. Combining Lemmas 14.41 and 14.51 we find that there exist words p, v such that p — p and 
either OpOvlpl -< u, or IplvOpO -< u. In the former case we may take a := Op, b := pi and 
w := Owl, and in the latter case we may take a := Ip, b := pO and w := IvO. □ 

Finally, we require the following lemma which characterises those finite words for which all 
cyclic permutations are balanced. This result appears to be something of a "folklore theorem" in 
the theory of balanced words; to the best of our knowledge, the proof which we present here is 
original. A version of this result appears as [3] Thm 6.9]. Note that the word u := 1001 is an 
example of a balanced word with the property that u°° is not balanced. 

Lemma 4.7. Let u be a nonempty finite word. Then the following are equivalent: 

(i) Every cyclic permutation of u is balanced, 
(ii) The finite word v} is balanced. 
(Hi) The infinite word u°° is balanced. 
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Proof. It is clear that (iii) (ii) =^ (i). To prove the imphcation (i) (ii) by we shah show 
that if u is a nonempty finite word such that is not balanced, then there is a cyclic permutation 
of u which is not balanced. 

Let us then suppose that u is a finite nonempty word such that is not balanced. Let a, b 
be subwords of of equal length such that ||a|i — |6|i| > 2, and suppose that no pair of shorter 
subwords may be found which also has this property. Clearly we have ||a|i — |6|i| = 2, and without 
loss of generality we shall assume that |a|i = 2 + By Lemma [4.41 there exists a palindrome 
p such that a = Ipl and b = OpO, and it follows from Lemma [4.51 that \a\, \b\ < \u\. We may 
therefore choose words c and d such that |c| — \d\ — \u\ — \a\ — \u\ — \b\ and ac bd ~ u. Since 
\ac\i — \bd\i = \u\i we have = 2 + |c|i, and since a and b are the shortest words with this 
property we must have |6| = \a\ < \c\. Now, since ac ~ u, it is not difficult to see that every word 
which is a subword of some cyclic permutation of u and has length at most |c| must occur as a 
subword of the word cac. In particular b -< cac, and since |6| = \a\ we have either b ^ ca or b ^ ac. 
In either case we have shown that there exists a cyclic permutation of u which has both a and b 
as subwords, and no word with that property may be balanced. We conclude that (i) cannot not 
hold when (ii) does not hold, and so (i) =4> (ii) as required. 

It is now straightforward to show that (ii) =4> (iii). Let u be a finite nonempty word such that 
is balanced; then every cyclic permutation of u is balanced, since the cyclic permutations of u 
are precisely the subwords of with length \u\. Now, the cyclic permutations of are precisely 
the words of the form v'^ where w ~ u; but since (i) =^ (ii), all of these cyclic permutations 
must be balanced also. Applying the implication (i) (ii) again we deduce that is balanced. 
Repeating this procedure inductively shows that m^*" is balanced for every A: > 1, and this yields 
(iii). □ 



5. Relationships between balanced words and extremal orbits 

The principal goal of this section is to show that for each a G (0, 1], every recurrent x G T, 
which is strongly extremal for Aq, is balanced. We also prove some related ancillary results which 
will be applied in the following section. 

The following valuable lemma shows that under quite mild conditions the trace, spectral radius, 
Euclidean norm and smallest diagonal element of a matrix of the form A{u) approximate each 
other quite closely. For every B G M2(M) we define <){B) to be the minimum modulus of the 
diagonal entries of B. 

Lemma 5.1. Let a e [0, 1] and N > 2, and let u be a nonempty finite word such that 0^, 1^ 7^ u. 
Then, 

1 



2JV2 



Proof. Let m{B) denote the maximum of the entries of a non- negative matrix B e M2(R). The 
inequalities 



Ills III - VpiB*B) < ^iv{B*B) < 2m{B) 

and 

HB)<hTB<piB)<l\B\l 

are elementary. To prove the lemma, it therefore suffices to show that m (^'■"' (u)) < N^d{A''"^ (u)) 
whenever 0^, 1^ 7^ u. Since ^("^(u) = al"li.4(u) it is clearly sufficient to consider only the case 
a = 1. 

Let us prove this inequality. We shall suppose that the final symbol occurring in w is 0, since 
the opposite case is easily dealt with by symmetry. Let n > 1 and ai , . . . , a„ > 1 be integers such 
that either u = O^'-r^-^O""-^ ■ ■ ■ l^^o^i with n odd, or u = i^r^Q'^"-! . . . ia20'^i with n even. 
By hypothesis we have ak < N — 1 for every k. 
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For 1 < fc < n let us define 



ai 



1 

aa + ■ ■ ■ + 



1 

ak-i H 



in least terms, and define also Poi^-i ■— and P-i,qo '■= 1. The integers Pk,Qk then satisfy the 
recurrence relations pk = a^Pk-i + Pk-2 and qk — a^qk-i + qk-2 for all k in the range 1 < k < n. 
A well-known formula for pk j qk implies 



if n is odd, and 

A{u)^A\-Al-' ---Al^ = 



I an \ I 1 0\ I I ai \ _ I Pn qn 

1 ; an-l I 1 y V P^-^ 9n-l 

1 \ / 1 a„_i \ / 1 ai \ ^ / p„_i qa-i 

a„ 1 M 1 / ■ " I I \ Pn qn 



if n is even (see, e.g., [I^). If n is odd then clearly t){A{u)) ~ min{p„, and since q„ = 
a„g„„i + g„_2 < + l)gri-i < Nqn^i and p„/g„ > l/(ai + 1) > l/A^" we obtain m{A{u)) = 
qn < min{A''p„, A^(7„_i} < N'^'i3{A{u)) as required. If n is even then similarly m{A{u)) = qn < 
A^g«-i < N^Pn-i = A^^O(yl(u)). The proof is complete. □ 

Let a,w,b be nonempty finite words with \a\ — \b\. We shall say that (a, w, 6) is a suboptimal 
triple if either d > b and > ?«, or 6 > a and w > w. We require the following lemma due to V. 
Blondel, J. Theys and A. Vladimirov |6i Lemma 4.2]: 

Lemma 5.2. Let w be a nonempty finite word. Then A{w) — A{w) — k{w)J , where k{w) G Z 
and 

' 1 
-1 

Moreover, k{'w) is positive if and only if w > w, and negative if and only if w < w. 



J -.^ A^Ai - AiAo 



The following is a slightly strengthened version of [6, Lemma 4.3]: 

Lemma 5.3. Let (a,w,b) be a suboptimal triple, let _Bi,i?2 be non-negative matrices, and let 
a e [0, 1] . Then 

tr (^BiA^°'\awb)B2^ > tr (^BiA^°'\awb)B2^ + a'"™''!^ t)(Bi)t)(B2). 

Proof. Since tTA^°''^{u) — a^^^'-ti A{u) for every finite word u it is clearly sufhcient to treat only 
the case a = 1. We shall deal first with the case where a > b and w > w, the alternative case 
being similar. Since a > 6 we may write a = ulc, b — cOv for some finite words c, u and v (which 
may be empty). Note that J satisfies the relations 

AiJAi = AqJAo = J, AqJAi = -1 ) ' ^i-^^o =(^10 

and hence by Lemma 15. 2i 

tr {A{a){A{w) - A{w))A{b)) = fc(w)tr ^{u) ( } J ) -^C'^)) > 1- 

Now, a direct calculation shows that for any non- negative matrix C S M2(R) we have tr (BxCB^) > 
D(i?i)Z)(i?2)tr (C). Since the matrix A{a){A{w) — A{vS))A{f)) is non-negative, we deduce that 

tr {BxA{awb)B-2) - tr {BxA{awb)B-2) = tr {BxA{a){A{w) - A{w))A{b)B2) 

> 0(Si)0(B2)tr {A{a){Aiw) - Aiw))A{b)) 

>d{Bi)d{B2) 
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as required. In tlie case where b > a and li > w, the integer k{w) and the matrix AqJAi each 
contribute a negative sign to the product A{a)(A(w) — A{w))A{b)) and the same conclusion may 
be reached. □ 

We may now prove the following two results which will allow us to characterise extremal orbits 
in terms of balanced words: 

Lemma 5.4. Let < - < 1, with the integers p and q not necessarily coprime. Suppose that 
\u\ — q, \u\i — p and 

(5.1) p{A{u)) = max{p{A{v)) : \v\ = q and \v\i = p} . 

Then the infinite word u°° is balanced. 

Proof. We shall begin by showing that if u has the properties described then it is balanced. Let 
us assume for a contradiction that u has these properties but is not balanced. By Lemma 14. 6[ 
there exists a suboptimal triple (a, w, b) such that awb -< u. Let us write u — siawbs2 and define 
u := Siawbs2. By Lemma 15.31 we have tr (^(m)) > tr (A{u)). Since A{u) and A{u) are both 
non-negative matrices with unit determinant, it follows that 

p(-4(u)) = I (tr {A{u)) + v/tr(^(u))2-4) > 1 (tr {A{u)) + ^tr {A{uW - 
= p{A{u)). 

Since clearly |u| = \u\ and = this is a contradiction, so u must be balanced as required. 

Now, suppose that u satisfies (|5.ip with |u|i = p and \u\ = q, and that w is a cyclic permutation 
of u. It is a well-known property of the spectral radius that p(i?ii?2) — p(-B2-Bi) for any Bi, B2 € 
M2(M), and it follows from this that p(A{v)) = p{A{u)). By applying the preceding argument 
to V it follows that v is also balanced. We conclude that all of the cyclic permutations of u are 
balanced, and by Lemma l477l this implies that u°° is balanced as required. □ 

Proposition 5.5. Let a e (0, 1] and suppose that x £ Za- Then x is balanced. 

Proof. To prove the proposition, let us suppose that there exists a recurrent infinite word x £ Zq, 
which is not balanced. We shall then be able to deduce a contradiction, and the result follows. 
The general principle of the proof is that if x is recurrent and not balanced, then we can construct 
a word based on x along which the trace of the product A'^^^x^n) grows "too rapidly". 

Fix a real number > 1 such that C-^\\B\\a < \\B\\ < Ca\\B\\a for all B e M2(M). By 
Lemma [3.41 we have g{a) > 1, and by Gelfand's formula we have (0")L 
It follows in particular that there is an integer A^o ^ 2 such that ||.4^"^ (O^")]! 
therefore 0^° 7^ z for every z E Za. Similarly we may choose iVi > 2 such that 1^^ 7^ z for every 
z G Za. Let N := maxjA^o, ^i}, and choose a further integer M >2 such that 



— > 1 as n 00. 
< g{a)'^° and 



^ a a J 



< 



If 1; is any subword of x, then there exists n > such that yl'"^(w) — ^*^")(T"a;, |w|), and since 
T"a; e Za this implies 



(5.2) 



> 



1 



27V2 



A^^^v) 



> 



1 



2CaAf2 



(T"a;,|D|) 



2CaiV2 ' 



where we have used Lemma |5. II On the other hand, for any nonempty finite word m, 



(5.3) 



tvA^'^Hu) < 2p (^A^"\u)'^ < 2 A^'^Hu) < 2g{a) 



Now, since x is not balanced, it by definition has a subword which is not balanced. Applying 
Lemma 15.31 to this subword we deduce that there exists a suboptimal triple {a,w,b) such that 
awb -< X. Define £ := |aw6|, and fix an integer K > 1 such that 

K 



1 



a 



> 2CaN^ 
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Since x is recurrent there are infinitely many occurrences of tlie word awb as a subword of x, and 
so we may choose words si, . . . , sk+i such that the word 



,(0) •= 



si{awb)s2(awb)s^ SK{awb)sK+i 



is a subword of x. Let L := lu^*^^!, and for i = 1,. . . ,K define a new word m*-*' by reversing the 
first i explicit instances of the word w in m'-^'; that is, 



,(1) 



= Si{awh)s2{awb)sz SK{awb)s 



K+l, 



,(2) 



Si{awb)s2{awb)s^ SK{0'Wb)sK+i 



and so forth, up to 



,W - 



si{awb)s2{a'wb)s3 SK{awb)sK+i- 



Note that for each i we have, by applying Lemma 15.31 i times and using 



(5.4) 



> 



since u*^"^ is a subword of x. As a consequence we observe that 0*^ 7^ u*^') for every «, since if we 
were to have 0*^ -< u^*-' for some i then we could obtain 



< 



(0 



.g{a) 



< 



g(«)^ 



a contradiction. Clearly an analogous contradiction would arise if we were to have 1*^ -< u^'^ and 
we conclude that 1^^ 7^ u'-'^ also. 

Now, for i = 1,...,K let c*^*\ d'^*-' be those words such that w*^*^^^ = c'^^^awbS''^ and = 
c^'^^ awbd''^\ Note that |c'^'-'| + + £ = L for each i. Making i applications of Lemma [5.31 and 
using (15. 2p yields 



tryi(")(c(')) = try^f") (si(aw&)s2 . . . s,^i{awb)s^) 

> tr^'"^ {si{awb)s2 ■ ■ ■ Si^i{awb)si) > 



^?(a)l' 



2C„iV2 ' 



since the last of these words is a subword of u^^\ and m*^"^ is a subword of x. Since c^*^ -< w^*^ 
and 0^^, 1^^ / u*^*^ we have 0^^, 1^^ 7^ c^'\ and by Lemma [5. II in combination with the preceding 
inequality this implies 



(5.5) 



Equally, since d^*-' -< u^"-* and m^^-' is a subword of x, we may apply (|5.2p to obtain 

We may now complete the proof. Combining (j5.5p . (|5.6p . and (|5.3p we obtain for each i 



and hence by Lemma 15. 3[ 



a 



8CINHP - 16ClN'^APg{a) 
try^(")(u(*-i)). 



tryt(")(u('-i)). 



14 



KEVIN G. HARE, IAN D. MORRIS, NIKITA SIDOROV, AND JACQUES THEYS 



In combination with (|5.3|) and (|5.4[) this yields 

contradicting cm- choice of K. The proof is complete. □ 

6. Study of the growth of matrix products along balanced words 

In this section we analyse in detail the exponential growth rate oi A{x, n) in the limit as n — )■ oo 
for X G X^, investigating in particular the manner in which this value depends on 7. A construction 
with similar properties is discussed briefly in [S] §4.3]. The results of this section are summarised 
in the following proposition: 

Proposition 6.1. • There exists a continuous concave function S: [0,1] R such that 

for each 7 e [0, 1], 

lim -log|||yl(a;, n)||| = lim - log p{A{x,n)) ^ S{^) 

n— voo 77, n— >-oo ri 

uniformly for x G . 

• Ifj =p/q e [0, 1] n Q then 5(7) = q^^ log p{A{x,q)) for every x e X-^. 

• The function S also satisfies inf..y£[o,i] S = S{0) = S{1) = 0, sup 5 — 5(1/2) — log£i(l), 
and 5(7) = 5(1 - 7) for all 7 € [0, 1]. 

• The function S is non- decreasing on [O, ^] . 

The proof of Proposition 16.11 is given in the form of a sequence of lemmas. Specifically, the 
result follows by combining Lemmas I6.2H6.41 and Lemma 16.61 below. 

Lemma 6.2. Let 7 G [0, 1]. Then there exists a real number 5(7) such that 
lim — log |||^(x, n)||| = lim — logp(^(a;, n)) = 5(7) 

n— ^00 fi n— foo 11 

uniformly over x € X-y . 

Proof. In the cases 7 = 0, 7 = 1 the lemma is trivial, since by Theorem 12. II the set X~^ consists of 
a single point which is fixed under T, and the result follows by Gelfand's formula. To prove the 
lemma in the nontrivial cases we use a result due to A. Furman |17| on uniform convergence for 
linear cocycles over homeomorphisms. Since in general the transformations T: Xj — >■ Xj are not 
homeomorphisms, this is achieved via an auxiliary construction. 

Let us fix 7 g (0, 1). Define a space of two-sided sequences Xj C {0, 1}^ as follows: the 
sequence x = {xn)nez € {0, 1}^ belongs to X^ if and only if there exists 5 € [0, 1] such that either 
Xn = \{n + 1)7 + (5] — [n7 + S~\ for all n e Z, or a;„ = [(n + 1)7 + (5J — [nj + S\ for all n € Z. 

It follows from the discussion subsequent to the statement of Theorem 12.11 that the two-sided 
sequence (a;i)igz belongs to X^ if and only if the one-sided sequence {xi+k)i^i belongs to X^ for 
every fc G Z. We equip X-y with the topology it inherits from the infinite product topology on 
{0, 1}^, and define T: X^ ^ X^ by ^[(a;,),^^] := (a;i+i)igz analogously to the definition of T. In 
the same manner as for the transformation T: X,y — > X^, one may show that T: X^ — > X-y is a 
continuous, uniquely ergodic transformation of a compact metrisable space. 

Finally, we define A: X-y x Z — > M2(M) in the following manner: given x — {xi)i^z G X-y and 
n > 1, we define Aix,n) := •••A^,, Aix,~n) := ' ' ' = ^(^-"a;, n)-i, 

and A{x, 0) — I. It may be directly verified that A is continuous and satisfies the following cocycle 
relation: for all x £ X-y and n,m ^ "Z, we have A{x, n + m) = A{T^^x, m)A{x, n). 

Now let > 1 be as given by Lemma [4.11 For each x G X-y we have 0^, 1^ 7^ x. Since for 
each X = {xiji^z G X.y we have {xij'^i € A-y, it follows from this that the matrix product which 
defines A{x,N) is a product of mixed powers of Aq and Ai, and does not simply equal Aq or 
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. A simple calculation shows that this implies that for each x G X^, all of the entries of the 
matrix A{x,N) are strictly positive. We may therefore apply [T71 Theorem 3] to deduce that there 
exists a real number £'(7) such that ^ log ||yi(a;, n)\\ converges uniformly to S{'-f) for x S X-y. Since 
clearly for each n> 1, 

^A{x, n): X €z X-y^ — {A{x, n): x €z X^} , 

this implies that ^ log ||yl(a;, n)|| converges uniformly to S{j) for x G X^. Since as previously 
noted we have 0^,1^ 7^ x for all x £ X^, it follows immediately from Lemma l5.ll that also 
^ logp(^(a;, n)) — > <S'(7) uniformly over x € X~^. The proof is complete. □ 

Lemma 6.3. The function S has the following properties: 
(i) Let 7 = p/q £ [0, 1], not necessarily in least terms: then S{^) — q^^ log p{A{x, q)) for every 

X e Xp/g . 

(ii) Let u be a finite word such that \u\ = q, \u\i = p. Then S{p/q) > q~^ log p{A{u)) . 

(iii) Let 7 e [0,1] he irrational. Then there exist x € Xj and a sequence of rational numbers 
(Pn/'ZrOn^i Converging to 7 such that S{pn/qn) = qn^ ^og p{A{x,qn)) for every n > 1. 

(iv) For every 7 € [0, 1] we have S{'y) — 5(1 — 7). 



Proof, (i). By Theorem 12. II we have T'^x = x for every x £ ^p/q: and so for every x € ^p/q, 

S{p/q)^ lim -^-logP(x,fcg)||| = lim log ||| = ^ log p (^(x, g)) . 

n->oo kq fc->oo kq q 

(ii). Clearly the set of all words v such that — q and |f|i = p is finite, so there exists 
a word v which attains the maximum value of p{A{v)) within this set. In particular we have 
p{A{v)) > p{A{u)). By Lemma [5.41 the infinite word v°° G E is balanced, and since it is clearly 
recurrent we have v°° S ^p/q by Theorem 12.11 By part (i) this implies q"^ logp{A{v)) = S[p/q) 
as required. 

(in) Let x S X^ be as given by Lemma[13{u), and let {qn)'^=i be a strictly increasing sequence 
of natural numbers such that ■Kq^[x) is a standard word for every n. Define p„ :— \T^q^{x)\i 
for each n > 1. By the definition of X^ we have Pn/qn ^ 7- Since each 'nq^{x) is standard, 
['Kq^{x)\°° e Xp^^/q^^ for each n by Lemma l4.3( i). and by part (i) of the present lemma this implies 

Sipn/qn) = qn^^Ogp{A{x,qn)). 

(iv) For each finite or infinite word uj, define uJ to be the mirror image of w, i.e., the unique 
word such that cUi = 1 if and only if uji — 0. It is clear that x € X-y if and only if x G Define 
i? = ( 1 ) and note that R^^AqR = Ai and R^^AiR ^ Aq. li x e Xry and n > 1, then 

R^^Aix, n)R ^ (R-^A^^R) ■ ■ ■ {R-^A^^R){R-^A^,R) = A{x, n) 

and in particular p{A{x, n)) — p{A{x, n)). It follows easily that 5(7) = S{1 — 7). □ 

Lemma 6.4. The function S satisfies 5(0) = inf 5 = and 5(^) = sup 5 = logf3(l). 

Proof. The reader may easily verify that 

(6.1) i^ii = IIAoll = Po^ilP = ^A^Aor = p{AoA^y/^ = i±^. 

By Theorem 12. 1[ we have Xi^2 = {(01)°°, (10)°°}, so by Gelfand's formula we have 

lim \lA{x,n)f^" = p{AoA,)i = p{A,Ao)-^ = ^^-^ 
when X G ^1/2- Let us show that g{l) ~ ^^^^ . In other words, we will prove that 



sup(p(a;,n)f/": xG S| = lim P((01)° 



II V" 



l + %/5 



2 

Suppose X has a tail different from (01)°°. Then it must contain one of the following subwords: 
wi = 11(01)"1, W2 = ll(01)"00,u;3 = 00(10)"0,w;4 = 00(10)"11 with n > 0. In view of mirror 
symmetry, it suffices to deal with wi and W2. We will show that it is possible to replace them 
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with subwords of (01)°°, w'l and W2 respectively, in such a way that the corresponding growth 
exponent does not decrease. 

Namely, put w[ = (10)"+^1 and W2 — (10)"+^. It is easy to see that for n > 1, 



(AiAo)" = 



F2n~2 F2n~l 
F2n~l F2n 



F2n+1 


F2n-1 


fin+3 


F2n+1 


F2n+2 


F2n+1 


F2n+3 


F2n+2 



where, as above, (i^n)5^Lo is the Fibonacci sequence (with Fq = Fi = 1). Hence 

Al{AoA,)"Ai 

whereas 

{A^Aor+'A, 

i.e., A{'w'i) dominates A{'Wi) entry- by-entry. Similarly, 

AiiAoAirAl 

and 



F2n F2n+2 
F2n+2 F2„+4 



(A A \n+2 ( F2n+2 -^2^+3 
\i^2n+3 i^2n+'iy 

Thus, q{1) — — e^^'^\ and since clearly S{'^) < logp(l) for every 7 e [0, 1] this implies that 

sup 5* — S{\/2). On the other hand, it is clear that Xq contains a single point x corresponding to 
an infinite sequence of zeroes, and for this x we have S{0) = logp(v4o) = 0. Finally, since every 
matrix A{x^ n) is an integer matrix which has determinant one and is hence nonzero, every x d T, 
has i log |||^(a;, n)l\ > for all n and therefore £'(7) > for every 7. □ 

Lemma 6.5. The restriction of S to (0, 1) H Q is concave in the following sense: i/ 71,72, A G 
(0, 1) n Q then S{Xji + (1 - A)72) > XSiji) + (1 - X)S{-f2)- 

Proof. For i = 1,2 let 7^ = Pi/qi in least terms, and let A — k/m. Let M — max{(7i,g2}- As a 
consequence of Lcmma l^^T i) there exist finite words u*-^^ G such that lu*-*-* |i — Pi, lu'-*-' | = Qi 
and S{'-fi) = q^^ log A{u^^'') for each i. 

Since < 71,72 < 1 we have < \pi\ < \qi\ and therefore 0^^,1^^ / (m^'^)'' for i = 1,2 and 
every £> 1. In particular, for each ii,i2 > 1 the word (u'-^'^Y^ {u'^^^Y^ does not have 0^*^ or 1^*^ 
as a subword, and hence by Lemma 15.11 



p{A{u^'^YHu^'Y')>^{^i^^'''y'(^^'^y') >f(-4(wW)^03(-4((^.(2)/2) 

Applying this inequality together with Lemma 16.3^ 11) . for each n > 1 we obtain 

/ kpiq2 + {m- k)qiP2^ 



^(A7i + (1 - A)72) = S 
> 



V mqiq2 
i log p{A{{u'^^^ r'"^' (y(2)yi(m-fc)9i 

nmqiq2 



> 

nmqiq2 



flog p(yi((w(l) ))+ log p(yt((u(2))"('"-'=)9i)) - log 64]^^) 



logp(^(M(l))) + logp(^(M(2))) _ _^ 



niqi mq2 nmqiq2 

log 6471^* 

- A5(7i) + (1 - A)5(72) - ^ . 

nmqiq2 

Taking the limit as n — >■ 00 we obtain the desired result. □ 
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Lemma 6.6. The function S : [0, 1] — !■ R is continuous and concave. 

Proof. By Lemma the restriction of S to (0, 1) n Q is concave. Define a function S : [0. 1] 
by 

S{j) := hm sup {S{-f^): 7* € (0, 1) n Q and I7* - 7] < e} . 



Note that S is well-defined since S is bounded by Lemma 16.41 We shall show in several stages 
that S is continuous, concave, and equal to S throughout [0, 1]. 

We first shall show that S is concave. Let 71, 72, A e [0, 1], and choose sequences of rationals 
^7^"^^, (72"^) ^^"^ i^n) belonging to (0,1), converging respectively to 71,72 and A, such that 

lim„_yoo S (li"^^ — Siji) for i = 1,2. We then have 

5(A7i + (1 - A) 72) > limsup^ (a„7["' + (1 - A„)7("^) 

> limsup XnS (7["') + (1 - A„) S (7^"^) 

= Jim A„5(7i"')+(l-A„)5(7^")) 

= A§(7i) + (1 - A) §(72) 

using Lemma 16.51 and S is concave as claimed. In particular the restriction of S to the interval 
(0, 1) is continuous (see for example @3l Thm 10.3]). 

We next claim that S{j) — S{'-f) for rational values < 7 < 1. Given 7 e (0, 1) n Q, choose a 
sequence of rationals (7,1) such that 7„ — >■ 7 and S{jn) — ^ If < 7 < 7ri for some n then 

^(7) > f 1 - -) ^(0) + -Shn) = ^5(7n), 
V In J In In 

and similarly if 7„ < 7 < 1 then 

s„,(ie^)5h..,+(i^)s(i,i(i^)sh„)- 

It follows that by taking the limit as n ^ cx) we may obtain £'(7) > >S'(7), and the converse 
inequality S'(7) > 5(7) is obvious from the definition of S. This proves the claim. 

We now claim that limT,^o S^(7) = S{^) = = 5(0) and lim^,^! 5(7) = S^(l) = 5(1). 
Since 5(7) = 5(1 — 7) for every 7 g [0, 1] by Lemma [131 iv) it is sufficient to prove only the first 
assertion. By Lemma 16.41 we have 5(0) — inf 5 = and therefore inf 5 > 0. Since 5 is concave 
there must exist (5 > such that the restriction of 5 to [0, S) is monotone, and so if we can show 
that lim„_j.oo 5(l/ri) = then the desired result will follow. By the preceding claim it is sufficient 
to show that lim„_yoo 5(l/n) = 0. For each n > 1 it is easily verified using Lemma 14.71 that 
(0"1)°° e Xijn-, so using Lemma IQl i) we may estimate 

< 5 f iU ^ logp(A„"AO < ^ logtr ^AlA,) - ^^S^" + 



n) n+1 '~n+l " n+1 

and therefore S{\/n) — ?> 0. This completes the proof of the claim. 

To complete the proof of the lemma it suffices to show that in fact 5(7) = 5(7) when 7 is 
irrational. Given 7 G [0, 1] \ Q, let x € and {Pn/<ln)'^=i be as given by Lemma lOT ni). Since 
5 is continuous and agrees with 5 on the rationals, we may apply parts (iii) and (i) of Lemma l6.3l 
to obtain 

S{j)= hm —\ogp{A{x,qn))= lim S ( ^) = lim S ( ^) =5(7), 

n-i-oo g„ n-i-oo \qn J n^co \qn J 

and we conclude that 5 = 5 as desired. □ 
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To conclude the proof of Proposition [^31 we note that the function S being non-decreasing on 
[O, ^] follows from its concavity and the fact that max S{j) — S{l/2). 

7. Proof of Theorem 12.31 

Before commencing the proof of Theorem 12.31 we require the following simple lemma: 

Lemma 7.1. For each a G [0,1] we have g{a) > e^^'^^a'^ for all 7 G [0,1]. If a £ (0,1] and 
X^nZa^ 0, then C Z„ and g{a) = e'^^'^^'a'^. 

Proof. In the case a = 0, an easy calculation using Proposition 16.11 and the definition of g shows 
that g{a) = p{Ao) = 1 = e'^^'^^. It is therefore clear in this case that g{a) — e^'^^'^a^ if and only if 
7 = 0. For the rest of the proof let us fix a G (0, 1] and 7 G [0, 1]. For each x G X^, we have 



log g{a) — lim sup sup 
1 



los 



z G S y > 



lim — log 

n— 5-00 Tl 



= lim 



log|||yl(a;,n)||| + (;(7r„(a;)) log a = 5" (7) + 7 log a 



so that g{a) > e^'^^^o? . If a; G X-y n then by the definition of Za we have 



S{l) 



7 log a — lim — log 



A^''\x,n) 



— lim — log 

n— >-oo Tl 



A^^'Xx^n) ^\ogg{a) 



so that g{a) — e^'^'^'^a^ , and since by Theorem 12. II the restriction of T to X-y is minimal it is clear 
that XyfZZa. □ 

We also require the following lemma, which is an easy consequence of a result in [6]: 

Lemma 7.2. Let a G [0,1] and let u,v be nonempty finite words such that p{A'--"\u)Y^^"^ = 
p{A'-"\v)y/\''\ = g{a). Then <;(u) = <;{v). 

Proof. In [6], Blondel, Theys and Vladimirov define two nonempty finite words u, v to be essentially 
equal if there exist finite words a, b such that au°° — bv°° . In particular it is clear that if u and v 
are essentially equal then necessarily <;(u) — Blondel et al. then associate to each nonempty 
finite word uj the set J„ = {a G [0, 1] : A^°''> (oj) = In |6l Lemma 4.4] it is shown that if 

Ju^Jv 7^ then u and v are essentially equal. We deduce from this that if u and v are nonempty 
finite words which satisfy p{A^"-\u)f/\'^\ = p{A^"-\v)f/\''\ = g{a) for some fixed a G [0,1], 
then a G J„ H Ji, by definition; this implies that u and v are essentially equal, and therefore 
<i{u)=q{v). □ 



Now we are ready to prove Theorem [ 

1. Existence of r. We shall begin by showing that for each a G (0,1] there exists a unique 
7 G [0,1] such that X^ O Za 7^ 0. Let a G (0,1]. By Lemma 13.51 the set Za is compact and 
invariant under T, and this implies that it contains a recurrent point (see e.g. |27l p. 130]). It 
follows by Proposition 15.51 that Za contains a recurrent balanced infinite word, and hence there 
exists 7a G [0, 1] such that X^^ n ^ 0. By Lemma FO it follows that e'^^'^^^a'*'" = g{a). We 
claim that is the unique element of [0, 1] with this property. By Lemma l7. II this further implies 
that Xj n Za = ^ when 7 ^ 7^^ . 



To prove this claim, let us suppose that < 71 < 72 < 1 with e 



= 5(72)0,72 = ^(q,)^ 

and derive a contradiction. Choose Ai, A2 G [0, 1] such that 71 :— A171 + (1 — Ai)72 and 72 :— 
A271 + (1 — A2)72 are both rational with 71 < 7i < 72 < 72- Applying Proposition 16. II we deduce 

S'(7i) + 7i loga = S{Xi-fi + (1 - Ai)72) + (Ai7i + (1 - Ai)72) log a 

> Ai(S'(7i) + 71 loga) + (1 - Ai)(S'(72) + 72 loga) = loggi(a), 

and hence e^^^^^a"^^ > g{a), for i — 1,2. Applying Lemma l7.ll it follows that e'^'^'^^-'a'^i — 

eS(72)Q,72 



g{a). Let x G X^-^ and y G Xj^, and let u :— nq-^{x) and v 



, (y). By Propo- 



sition [6T] we have g{a) — p(^"(u))^/l"l = p(^"(w))^/l''l, and since <r(u) = 71 < 72 = (;{v) this 
contradicts Lemma [7121 The claim is proved. 
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Let us define r(a) := 7^ for all a e (0, 1], and r(0) := 0. Note that g{0) = p{Aq) = 1 = e'^^^) as 
a consequence of Lemma 13.11 and Proposition 16. II It follows from this and the previous arguments 
that for all a,7 G [0, 1] we have g^a) > e^'^'^^'a'^ with equality if and only if 7 = t^(a), and for all 
a £ (0, 1] we have n Za ^ $ precisely when 7 — r(a), in which case C Za- 

2. Monotonicity of r. We now show that the function r thus defined is non-decreasing. Let 
us suppose that ai,a2 G [0, 1] with r(ai) < Tr(a2); this implies in particular that a2 is nonzero. 



By the preceding result we have ^(ai) 



- „S(t(Qi)) 



,S(r(a2))^^("2) 



e-— '^2- > e^(^("i))a^("^'. Consequently a^/"-)-^^"^) < e«(-("i))-S(^("=)) < 
and since v{a2) — r(ai) > we deduce that ai < a2- We conclude that if < ai < 0:2 < 1 then 
necessarily r(ai) < r(Q!2) and therefore r is non-decreasing as required. 

3. Continuity of r. We may now show that r is continuous. Given uq £ (0, 1] let r_ be the limit 
of x{a) as a ao from the left, which exists since r is monotone. For every a G (0, 1] we have 
g{a) = e^^^^°'''^a^'^"\ By Lemma [3.21 and Proposition 16.11 g and S are continuous, so taking the 
left limit at ao yields e'^^'^'^^^^^ao^""' = g{ao) ~ e^^'^^^a^f . Since r(a) is the unique value for which 
this equality may hold we deduce that v{ao) = as required. Similarly for every ao £ [0, 1) the 
limit of r(a) as a — ctQ from the right is equal to v{ao), and we conclude that r is continuous. 
Since r(0) = and r(l) = 1/2 as a consequence of Proposition 16.11 and we have shown that r is 
continuous and monotone, we deduce that r maps [0, 1] surjectively onto [0, i] as claimed. 



Consequently a]^"^^ 



> 

-r(ai 



gS(r(a2))^^^("2) and similarly g{a2) 



-t(Qi) 



4. 1-ratio and characterisation of extremal orbits. It remains to show that for each a 
the extremal orbits of may be characterised in terms of X^f^^) in the manner described by the 
Theorem, and that c(a) is the unique optimal 1-ratio of Aq,. In the case a = it is obvious that 
X e E is weakly extremal if and only if it is strongly extremal, if and only if x = 0°° £ Xq, and in 
this case the proof is then complete. For each a £ (0, 1], Lemma [3.61 shows that every recurrent 
strongly extremal infinite word belongs to Za, and therefore belongs to ^r(Q) by Proposition 15.51 
and the uniqueness property of v{a). 

To show that weakly extremal infinite words accumulate on X^(^a) in the desired manner we 
require an additional claim. Given a £ (0, 1], we assert that there is a unique T- invariant Borel 
probability measure whose support is contained in Za, and that this support is equal to X^(^a)- 
Indeed, let /it(a) be the unique T-invariant measure with support equal to X^^a)j the existence of 
which is given by Theorem 12. II If z/ is a T- invariant Borel probability measure with suppz^ C Za, 
define X := {x £ supp;/: x is recurrent}. It follows from the Poincare recurrence theorem that 
X is dense in suppz^ (see e.g. 27, Prop. 4.1.18]). By Proposition 15.51 everv element of X 
is balanced, and since t(a) is the unique 7 £ [0, 1] for which X-y D Za 7^ 0, it follows that 
X C X^(^a)- We conclude that suppi' C X^(^a) and therefore v — /^^(q) since the restriction of 
T to X^(^a) is known to be uniquely ergodic, which proves the claim. By Theorem 12.11 we have 
fJ-z{a){{^ G E: xi = 1}) = and we may now apply Lemma 13.81 to see that if a; G S is weakly 
extremal for A^, then (1/n) X]fc=o dist(a;, — )■ and ij(7r„(a;)) — r(a) as required. 

It remains only to show that for each a £ (0, 1], every x £ X^(^a) is strongly extremal in the strict 
fashion described by (|2.ip . Given any compact set K C (0, 1], choose an integer Nk large enough 
that Nk > max{ [r(Q!)~^] , [(1 — r(Q!))^^] } for every a £ K, and let Mk > 1 be the constant given 
by Lemma [3.31 Let a £ K and x £ ^r(Q)- By Lemma [4.11 we have 0^^,1^^ 7^ x, and since 
X C Zq, we have ||^'^"^(x,n)||Q = g{a)^ for all n > 1. Applying Lemma [5. 11 and Lemma [3731 



g(a)" _ 1 



< 



A^^'Hx,!!) 



A'-°'\x,n) 
< Mk 



< 



2Nl 



A^"\x,n) < p(A^'^\x,n)) 



A'^°'\x,n) <MKg[aT<2MKNlg{ay- 



so that (12. ip holds with Ck '■— 2MkN^. In particular this shows that for each a £ (0, 1], every 
x £ X^(^a) is strongly extremal. The proof of the Theorem is complete. 
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8. Proof of Theorem 11.11 

Recall from Proposition 16.11 that there exists a continuous concave function S : [0, 1] — > K such 
that for each 7 e [0, 1], 

5(7)= lim — log |||^(a;, n)||| = lim — log p{A{x , n)) 

uniformly for x E . We saw in the course of the proof of Theorem 12.31 that the function 
r: [0, 1] — [0, i] is characterised by the fact that ^(q;) > e^^'^^a'' for all a,7 G [0, 1] with equality 
if and only if 7 = r(a). Readers who have skipped the proof of Theorem 12.31 mav note that this 
characterisation can be deduced easily from the definition of S and the statement of Theorem l2.3l 
The proof of Theorem 1 1 . 1 1 operates by exploiting the concavity of S and the above relationship 
between S and r to compute a value a* G [0, 1] such that r(a*) ^ Q as the limit of a series of 
approximations. We begin with a result from convex analysis. 

Lemma 8.1. For each 7 e have r^^(7) — {ao} with ao G (0,1] if and only if S is 

differentiable at 7 and 5" (7) = — logag. 

See Figure [T] for a graph of S{'^) along with the tangent line of slope a*. 




Figure 1: Graph of 5(7), and tangent line at 7 « 0.3819660 ... of slope - log(0. 74932 . . . ) 

Proof. Recall that if / : [a, &] R is a concave function then 77 S K is called a suhgradient of / at 
z G [a, b] if f{y) < f{z) + rj{y — z) for all y € [a, b]. Furthermore, / is differentiable at z E (a, b) 
with f'{z) — rj ii and only if 77 is the unique subgradient of / at z (see for example [131 Thm 25.1]). 
To prove the lemma it therefore suffices to show that for each 76(0,^), yySRisa subgradient 
of S* at 7 if and only if £ (0, 1] and t{e^^) — 7. 

Let us prove that this is the case. For every a,j £ [0, 1] we have e'^'^'^'^^^^a"^'"^ > e'^^'^'-'a'' with 
equality if and only if 7 = r(a). For each fixed a G (0, 1) it now follows by a simple rearrangement 
that — log a is a subgradient of S at r(a) . Conversely, suppose that r/ e R is a subgradient of S at 
some 70 G (0, ^). By Proposition 16. 11 S is monotone increasing on the interval [0, i] and therefore 
we must have 77 > 0. Since 77 is a subgradient we have e'^^'''"^"''^'' > e^'^"'^~^'^ for all 7 e [0, 1], and 
since € (0, 1] it follows that 70 = t(e~'') as required. □ 
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The following corollary is not needed in this paper but since it is straightforward, we believe 
it's worth mentioning. 

Corollary 8.2. The function S is strictly concave on [0, 1] and strictly increasing on [0, 1/2]. 

Proof. If S were not strictly concave, there would be an interval (71,72) such that S would be 
linear on this interval. Hence S' would be constant on (71,72) which would mean, in view of the 
previous lemma, that r~^(7) would be constant for all 7 G (71,72)- This contradicts r being well 
defined (Theorem 12. 3|) . whence S is strictly concave. 

Since S is non-decreasing, continuous and strictly concave on [0, 1 /2] , it is strictly increasing. □ 

Throughout this section we let := ^-^^ denote the golden ratio. Recall that a real number 
7 is said to be Liouville if for every fc > there exist integers p, q such that < \j — p/q\ < ^/q^- 
A classical theorem of Liouville asserts that no algebraic number can be Liouville (see, e.g., [211 
Theorem 191]). In particular 0"^ is not Liouville. 

Lemma 8.3. Let 7 e [0, ^] and suppose that 7 is an irrational number which is not Liouville. 
Then there exists a unique a G [0, 1] such that v{a) = 7. 

Proof. By Theorem 12.31 the function r is surjective and monotone, so the set r~^(7) is either a 
point or an interval. To show that this set cannot be an interval, we shall suppose that there exist 
ao G (0, 1) and £ > such that x{a) = 7 for all a S [e~^ao, e^ao], and derive a contradiction. 

Since 7 is irrational but not Liouville, we may choose an integer fc > such that for all integers 
p,q with q nonzero we have I7 ~ p/q\ > l/?*^. A theorem due to the second named author [38l 
Thm 1.2] implies that for every r > 0, 

maxjp (A^°"'\x,m)j : a; e E and 1 < m < 7i| gi(ao) + O 

in the limit as n — > 00. In particular it follows that if n is some sufficiently large integer, then 
there exist an integer m and an infinite word x € T, such that 1 < m < n and 

(8.1) p (^("°) (x, m)) > (^1 - j g{a) > g-^" g{a). 

Let <;(7r„i(x)) = p/q in least terms; we shall suppose firstly that ^ — > 0, the opposite case being 
similar. By hypothesis we have g{\ao) — e^^''^^'^°'>\Xaoy'^°'°^ = e^ {XaQ)'^ = X'^ g{ao) for every 
A e [e~^,e^], and also ^ — 7 = || — 7I > q^'^ > n^''. Combining this with (|8.1I) and Lemma [01 
we obtain 

g{e'ao) >p[A^'' "°)(a;,m)J = e'P/'^ p \^A^°'°\x,m)j 

> g(^ao) = e<^'/«~^""^')e(e^ao) 

> gie^ao) , 

a contradiction. In the case | — 7 < we may similarly arrive at the expression 

g{e-'an) > e-'P/'^-'"'" giao) = e'i^-P/"-"'") g (e-'ao) >g{e-'ao) 
which is also a contradiction. The proof is complete. □ 

Let (Fn)J^o denote the Fibonacci sequence, which is defined by Fq := 0, Fi 1 together with 
the recurrence relation Fn+2 ■= Fn+i + Fn, and recall that F„ = (0" — (— 1/0)")/a/5 for every 
n > 0. Define a sequence of integers (t„)J^q by tq := 1, ti = T2 :— 2, and t„-|_i TnTn-i — t„_2 
for every n > 2. Finally, define a sequence of matrices {Bn)'^^i by Bi Ai, B2 := and 
Bn+i '■= BnBn~i for every n > 2. The key properties of Fn, Bn and t„ are summarised in the 
following three lemmas. 

Lemma 8.4. For each n > 2 the identities S{Fn-2/Fn) ~ F^^ \ogp{Bn) and FnFn-i~Fn+iFn-2 = 
(— 1)" hold, and the value lies strictly between Fn-2/Fn and F„_i/F„+i. 
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Proof. Define a sequence of finite words by u^^) ■= 1, i*(2) 0, and U(„-|-i) '■— U(n)''J'(n~i} for every 
n > 2. Clearly we have = -B„ for all n > 1. A simple induction argument shows that each 

is a standard word in the sense defined in Lemma |4.3[ and that |u(„)| = F^, = i^„_2 

for every n > 2. By Lemma [4.31 and Lemma IG-Bf i) we therefore have [u(„)]°° £ ^F„_2/i^n ^^'^ 
consequently S'{F„_2/-F„) — F:^^ logp(^(u(„))) = F~^ logp(i3„) for every rt > 2 as required. The 
remaining parts of the lemma follow from the fact that the fractions F^-ilF^ are precisely the 
continued fraction convergents of <^~'^ ■ Alternatively these results can be derived from the explicit 
formula for □ 

Lemma 8.5. For each n > 1 we have tr_B„ — t„. 

Proof. By direct evaluation the reader may obtain tr Bi = tv B2 = 2 = ti = T2 and tr ^3 = 3 = T3 , 
so it suffices to show that the sequence (tri3„) satisfies the same recurrence relation as (t„) for all 
n > 3. Let us write 



Cn a^i 

for each n > I. Notice that we have a„(i„ — 6nC„ = det i?„ = 1 for every n, and for each n > 2 the 
definition i?n+i := BnBn-i implies the identity 

Cn+l dn+1 J \ c„a„_i + 0?„C„_1 C„6„_i + C?„C?„_i 

Fix any n > 3. By definition we have 

tr Bn+i = an+i + dn+i = a„a„_i + 6„c„_i + c„5,i_i + 

and 

(trS„)(trB„_i) 

SO we may compute 

2) + ari-l(Cn-lferi-2 + C^ri-l'iri-2) 
— C„_i(a„_i6„_2 + bn-ldn-2) — &ri-l(Cn-ia„_2 + rfri-lCn-2) 
= 0-n-2{an-ldn-l — &n-lCn-l) + C?n-2 (On- l^n- 1 — &n-lC„_i) 
= an-2 + dn-2 = tri3„_2, 

which establishes the required recurrence relation. □ 

Lemma 8.6. There exist constants 5i,S2 > such that 

|logT„-logp(i?„)| =0(e-^i^") 

and 

e 



log (1 - 

Tn+lTn 



in the limit as n 



Proof. It is clear that Fn-2/Fn 4>~^ using the formula for ivi, and since S is continuous it 
follows via Lemma 18.41 that F,^^ logp(i3„) — >■ S{(j)~'^) > 0. Since deti?„ = 1 and i?„ is non- 
negative, the eigenvalues of i?„ are p{Bn) and p{Bn)^^ respectively, so for each n > 1 we have 
T„ = tri?„ = p{Bn) + p{Bn)~^, where we have used Lemma 18.51 Hence, 

where we have used the elementary inequality log(l + x) < x which holds for all real x, and this 
proves the first part of the lemma. 

It follows from this result that lim„_j.oo F~^ logr„ = S'((/)~^). We may therefore apply this to 
obtain 
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lim ± log [^^) = Jm^ logr„-i - logr„+i - -1 logr„ 



= ' - - 1) - -25(r') < o, 

from which the second part of the lemma follows easily. □ 

Proof of Theorem We will show that S"((/)^^) = —log a*, where a* satisfies the product 
and limit formulas given in the statement of the Theorem. By Lemma 18.11 this implies that 
r(a*) = (/)~^ ^ Q, and by Theorem 12.31 this implies that does not satisfy the finiteness 
property. 

By Lemma [8.11 together with Lemma 18.31 the derivative S"((/)~^) exists and is finite. Using 
Lemma 18.41 and Lemma 18.61 we may now compute 



— lim — 



F„+i F„ 

i-log/3(B„+i) - -^logp(B„ 



= lim 



Fn log p {Bn+l) - Fn+1 log p (B„) 



Let us define 



FnFn-l — Fn+lFn-2 

lim (-l)"(F„logp(B„+i) - i^^„+ilogp(B„)) 

n— >-oo 

lim (-1)"(F„ logT„+i - Fn+i logr„). 

n— ^OO 

a, := = lim ^ 

which yields the first of the two expressions for a*. We shall derive the second expression. Let us 
write an '■— (''")f"^V''"ri+i)'^^^ ^'^^ each n > 1 so that a* = lim„_j.oo c^n- Applying the recurrence 
relations for and (r„) once more, we obtain for each n > 1 

^^^^..+2/ f„+a(-i)"^' , p , (-1)" 



(-1)" I F, 



1 - 



Jn+lTn J \ Tn+lTn J \ T„+iT„ 

Since ti = T2 — 2 and Fi — F2 — I we have ai = 1. Using the formula above we may now obtain 
for each > 2 

N-l N-1 , s (-l)"F„+i 

T <-^n T \ 'n+1 ' n , 

n— 1 n— 1 ^ 



It follows from Lemma [8. 61 that these partial products converge unconditionally in the limit N — > 
00, and taking this limit we obtain the desired infinite product expression for a*. □ 

Remark 8.7. The proof of Theorem 11.11 may be extended to give an explicit estimate for the 
difference |a* — a^rl as follows. Note that for each n > 3 we have 1/3 < Fn-2/Fn < 1/2 and 
therefore, by Proposition 16. 11 



F-i logr„ > logp(B„) = S (^^'^ 



^ ^ -IX logp(AgAi) ^ log(2 + 73) 



24 



KEVIN G. HARE, IAN D. MORRIS, NIKITA SIDOROV, AND JACQUES THEYS 



On the other hand, if we define a sequence (t„)^]^ by f i = f2 = ti = T2 = 2 and f„+i := f„f„_i 
for n > 3, then it is clear that t„ < f „ = 2^" for every n > 1. Combining these estimates yields 



|logajv - log a* I < ^ Fn+i 



oo 

< 2 Fn+1 

oo 



log 1 - 



Tn-1 
Tn+lT„ 



< 



2E 

n=N 



Fn+lTn-1 
Tn+lTn 



2^- 



(2 + y3)^"+2/3 



< Cl ^ + 1) 



n=N 



< 



< 780 I - 



for all > 3, where 



C,:^i(^±^ = 2.77475, 



0.72441 



V5 ■ ■ V(2 + V3)^ 

It follows in particular that the value ais := Tj^^/tj'!}*' satisfies ja* — aisj < 10"^^, which yields 
the approximation given in the introduction. 

9. Further questions 

1. Is it true that a* is irrational or transcendental? The fast rate of convergence of the sequence 

(-1)" 

suggests that a* is probably irrational; however, perhaps unexpectedly, this rate 

itself is not fast enough to claim this. Roughly, to apply known results (see, e.g., [39]), we need t„ 
to grow like with A > 1 and B > 2. Then Theorem 1 from the aforementioned paper would 
apply. In our setting however we "only" have B = cj) < 2. 

A good illustration how tight the quoted result is is the famous Cantor infinite product 



n 



1 



1 

2^ 



equal to 2, despite its "superfast" convergence rate. However, a similar product 



n=0 



is indeed irrational. We conjecture that a** = r^^(l — 1/V2) (which corresponds to the substitution 
— > 001, 1 — > similarly to a* corresponding to the Fibonacci substitution — > 01, 1 -> 0) is 
irrational. 

2. Is r^^(7) always a point when 7 is irrational? We know this to be true if 7 is not Liouville (i.e., 
for all irrational 7 except a set of zero Hausdorff dimension) but the method used in Lemma 18.31 
is somewhat limited. We hope to close this gap in a follow-up paper. 

3. If the answer to the previous question is yes, then is it true that r~^(7) ^ Q whenever 7 ^ (Q)? 
This question is pertinent to a conjecture of Blondel and Jungers, which says that the finiteness 
property holds for all matrices with rational entries [26]. Our model should not, therefore, yield a 
counterexample to this conjecture. 

4. Is r^^(7) always an interval with nonempty interior when 7 is rational? It was shown by 
the fourth named author in his thesis [TT that (i) = [|, l] , and all other known examples 
indicate that the answer is positive. However proving this for a general 7 € Q seems like a difficult 
question. 

5. Does the set of all a such that r(a) ^ Q have zero measure? Does it have zero Hausdorff 
dimension? Analogues of these properties are claimed for Bousch-Mairesse's example but proofs 
are not given [8]. 
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We conjecture that the graph of r is a devil's staircase with the plateau regions corresponding 
to {7 : r(7) e Q} - see Figured 



0.5 



0.4 



0.3 



0.2 



0.1 




0.2 0^4 0.6 0^8 i 1.2 



Figure 2: Graph of c(7) 

Remark 9.1. Between the time of submisssion and present, some progress has been made on some 
of the questions above. Interested readers are welcome to contact the authors above to find out 
the current progress on these problems. 
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